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^D ■ Abstract 



We show that minimizing a convex function over the integer points of a bounded convex set is poly- 



?— j ■ nomial in fixed dimension. 



1 Introduction. 



One of the most important complexity results in integer programming states that minimizing a linear function 

over the integer points in a polyhedron is solvable in polynomial time provided that the number of integer 

^^ , variables is a constant. This landmark result due to Lenstra [10] has been generalized by Barvinok [2]: he 

^^ ' shows that one can even count the number of integer points in polytopes in fixed dimension. More recent 

extensions of Lenstra's algorithm apply to integer optimization problems associated with semi-algebraic 

sets and described by quasi-convex polynomials. The first polynomial time algorithm for minimizing a 

2 . quasi-convex polynomial over such sets in fixed dimension is due to Khachiyan and Porkolab [9]. Recent 

improvements of the complexity bound are given by Heinz [6] and by Hildebrand and Koppe [7] . 

In this paper, we drop the assumption that the functions describing the input to our problem are poly- 
nomials. Instead, we aim at minimizing a general convex or quasi-convex function over the integer points 
^ i in a bounded convex set in fixed dimension. The bounded convex set is defined by convex or quasi-convex 

lO ' functions as well. We assume that all the functions are encoded by means of evaluation oracles: queried on a 

C""^ , rational point, the evaluation oracles return the function values that the point attains. We assume that three 

further oracles are given, namely a continuous feasibility oracle, a separating hyperplane oracle and a linear 
integer optimization oracle. In order to realize them one needs additional assumptions on the functions. 

It is well known that Lenstra's algorithm can in principle be applied to any class of convex sets C in R" 
when n is a constant provided that we can determine an ellipsoidal approximation for every member in C 
efficiently. By an ellipsoidal approximation of a convex set we mean an ellipsoid E contained in the convex set 
such that a properly scaled version of E contains the convex set. (Typically, the scaling factor is 0{n)). The 
construction of such an ellipsoidal approximation is often performed by designing a shallow cut separation 
oracle (see, for instance, [H Section 3.3]). To the best of our knowledge it is not known how to construct 



JH ' ellipsoidal approximations for general convex sets in polynomial time even when the number of variables is 

■ - - fixed. This explains why general convex integer minimization problems with a fixed number of variables have 

not yet been extensively studied. 

We design a novel polynomial time algorithm for general convex integer minimization problems in fixed 
dimension that avoids going through the construction of ellipsoidal approximations. Instead we develop a 
cone-shrinking algorithm that from iteration to iteration produces smaller and smaller cones containing the 
convex set under consideration until we can reduce the original question to a series of similar problems in 
smaller dimensions. 

Our assumptions are as follows. Let /o, ■ • ■ , /m : R" i— > M be quasi-convex functions, i.e. for every a G R 
the level set {x G R" | fi{x) < a} is convex. Note that a convex function is also quasi-convex. For a given 
e G R>o, we define 

Ko := {x 6 R" I /.(x) < for alH = 0, . . . , m} , 
Ke ■■= {x e R" I Mx) < e for alH = 0, . . . , m} . 

Moreover, let _B, A, M G N be given numbers. We assume that Kq, K^ C [~B, _B]" and that \fi{x)\ < M for 
all X G {—B, i?]" and alH = 0, ..., m. For a point x G Q" the precision of x is the smallest integer q G N such 



that X has a representation x = (^, . . . , ^), where pj G Z for all j = 1, . . . , n. We are interested in rational 
points with a precision of at most A. We assume to have available three oracles. 

A-Feasibility Oracle. Given a polytope P in [—B, B]", the oracle returns a point x G P D K^ 
with precision at most A, or certifies that Pr\KQ does not contain a point with precision at most 

A. 

Separating Hyperplane Oracle. Given an affine space A and a point a G A, the oracle returns 
either that a G A'^ n^ or it returns a vector c G Q" with ||c||oo = 1 such that c^x < c^a for every 

X € KlDA. 

2 

Linear Integer Optimization Oracle. Given a polytope P and a linear objective function, the 
oracle returns a point in P n Z" with minimum objective function value, or certifies that P n Z" 
is empty. 

The polytope P that is part of the input in the A-Feasibility Oracle will always be defined as the 
intersection of the box [—B, P]" with an affine space. Polynomial time algorithms for realizing a A-Feasibility 
Oracle can be found in [3] and |llj . 

To the best of our knowledge there is no efficient algorithm for realizing a Separating Hyperplane Oracle in 
general. Rather, concrete realizations depend on properties of the functions /i, i = 0, . . . , tti. One particularly 
relevant case in which the Separating Hyperplane Oracle can be emulated is as follows. Let us assume that 
the functions /o,...,/m are convex. Moreover, let us assume that, for every x G [— P, P]" and for every 
i G {0, . . . ,m}, a subgradient of dfi{x) is known. Suppose now that an afRne space A and a point a G A 
are given. The question is to decide whether a G K,. Ci A, or - if not - to find a hyperplane that separates 
a from K^ n A. We start by checking whether a G K^ f) A. This can be done by simply substituting a into 
the functions fi, i — 0, . . . ,m. Let us assume that a ^ K^ D A. Then there exists one of the functions fi, say 
/o, such that /o(a) > £ > |. We take an element g G dfo{a). Note that g is the normal vector of a tangent 
hyperplane H of the epigraph of /o at the point (a, /o{a)). Next we shift H such that it contains the point 
a. Let the resulting hyperplane be H'. Then P' n A is a separating hyperplane. 

For a realization of the Linear Integer Optimization Oracle we refer again to the paper of Lenstra [TU] . 

We emphasize that the parameter s does not affect the number of iterations of our cone-shrinking algo- 
rithm. In fact, from now on we assume that e is fixed. Of course, it plays a role in the realizations of our 
oracles. Our main contribution is stated in the theorem below. 

Theorem 1.1. Let /o,...,/m, Kq, K^, B, and A he as above. In polynomial time in log(P) and log(A) 
either one can find a point z G Kg, n Z", or show that Kq n Z" = 0. 

Theorem 11.11 allows us to minimize any quasi-convex function /o over the integer points of a convex set 
L := {x G [—B, P]" I fi{x) < 0, for all i = 1, . . . , m} in polynomial time, when n is fixed. An approximate 
solution of the problem min{/o(a;) \ x € LCi Z"} can be computed by binary search in the interval [—M, M]. 
This follows since, for any 7 G [— M, M], Theorem 1 1.1 1 can be applied to the set {x G [— P, P]" | /o(a;) ~ 7 ^ 
O7 fi(,x) < 0, for alH = 1, . . . , m} instead of Kq. We thus derive the following corollary. 

Corollary 1.2. Let /o,...,/™; B, A, and M be as above. In polynomial time in log(P), log(A) and 
log(Af) either one can find a point z G Z" .such that fQ{z) < min{/o(a;) | x G Z" and fi{x) < e for all i — 
1, . . . , m} + e, or show that the problem min{/o(a;) | a; G Z" and fi{x) < for all i = 1, . . . , m} is infeasible. 

In the next section, we introduce the notation and we prove statements that are needed to show Theo- 
rem 11.11 Section [3] contains the proof of Theorem 11.11 Section |4] describes a straightforward generalization 
of our algorithm to the mixed integer case. 

2 Auxiliary lemmata. 

For a set S* C M" we denote by aff(5') the afSne hull of 5, by conv(S') the convex hull of 5, by int(5) the 
interior of S*, by dim(S') the dimension of the smallest affine space containing 5*, and by volj(5'), j' = 1, . . . , n, 
the Lebesgue measure of S with respect to a j-dimensional affine subspace containing it. We omit the 
subscript and simply write vol(S') whenever the dimension is clear from the context. For two sets S,T C M" 



we denote by S + T -.^ {s + t : s e S, t £ T} the Minkowski sum of S and T. When S is bounded, the set 
S — S := {x — y : X, y e S} is called the difference body of S. If M & R"^" is a matrix, then det(M) denotes 
the determinant of AI. 

In the remainder of this section we present five lemmata. Lemmata 12.21 and 12.51 are needed to prove 
Lemma 12.61 Lemmata 12.11 12.41 and 12.61 are used in the next section to prove Theorem 11.11 The following 
lemma states that the convex hull of the integer points of an n-dimensional closed convex set is lower- 
dimensional whenever its volume is sufficiently small. 

Lemma 2.1. Let K C M" he a closed convex set such that ^io\{K) < -^. Then dim(conv(ii' n Z")) < n — 1. 

Proof. For the purpose of deriving a contradiction assume that there exist n + 1 affinely independent points 
uo,...,w„ e i^nZ". Then vol(conv({uo,...,u„})) = ^|det(ui -vo,...,Vn - vo)\ > ^. D 

In the subsequent lemma we define for every n-dimensional closed convex set K a corresponding set K 
that is contained in K. The set K has the property that the Minkowski sum of K and a certain scaling of 
the difference body of X is a subset of K. Then, Lemma 12.21 gives an outer approximation of K and an inner 
approximation of K in terms of a certain ellipsoid. Consequently, this ellipsoid can be used to approximate 
K\K. Furthermore, we always have that K ^ ^. 

Lemma 2.2. Let K C M" he an n-dimensional closed convex set, and let 

K ■.:^ {xe~" 



^ -{K-K)C1K 

Then there exists an ellipsoid E C M" and a point c ^ K such that c + -^E C K and K <Z c + nE. 

Proof. By John's characterization of inscribed ellipsoids of maximal volume (see John [8j and Ball pLj), there 
exists an ellipsoid E centered at the origin, and a point c such that c + EcKcc + nE. By the definition 
of the difference body K ~ K, it follows that 2E = E-EcK-K(lnE-nE = 2nE. This implies 
^{K-K)c\E and thus ^E + ^{K - K) C E C K - c. Hence, (c+ ^E) + ^{K-K)cK. This implies 
that c + ^E C K. In particular, c e K. D 

Remark 2.3. // K C K" is a polytope, then the set K as defined in the previous lemma can be computed 
explicitly. For that, let K ~ {x G R" | ajx < hi, for i — 1, . . . ,m} be represented hy facet- defining inequali- 
ties. Then, for all i = 1, . . . ,m, we define pi := bi — Jiim{ajx \ x G K}, i.e. the width of K with respect to 
Qi. Since for the difference body K — K it holds that m.ax{ajx \ x (z K ~ K} — minja^a; \ x G K — K^ = 2pi 
for all i, it follows that K — {x G R" | aJx < bi — -j-pi, for i — 1, . . . , m}. 



For the two sets K and K defined in Lemma 12. 2| we prove next that when intersecting K with a half- 
space containing a point of K on its boundary, the volume of this intersection is guaranteed to decrease by 
a constant factor that is only dependent on the dimension. 

Lemma 2.4. Let K C R" be an n-dimensional closed convex set, and let K he defined as in Lemma \2.'A 
Furthermore, let x* d K and let iJ+ he a half-space with x* on its boundary. Then 

voi(Kni/+)<(^i--^)voi(if). 

Proof. The Brunn-Minkowski inequality (see, for instance, Gruber [Sj Theorem 8.5]) states that 2" vo\{K) < 
vo\{K — K). In addition, we have x* -f -^{K — K) C I-C . Furthermore, due to the central symmetry of the 
difference body K — K, we have 

voi((x* + ^(i^-i^))ni/+)^ivoi(-l(x-x) 



Hence, 



vol(i^ n H+) < voliK) \ vol [^^{K K)^ < [l - -J^) vol(i^). 



D 



The following lemma is one of the key ingredients of our proof of Theorem 11.11 It applies to two similar 
truncated second order cones: if one of them does not contain a point of a lattice, then the lattice points 
contained in the other truncated cone lie on a number of hyperplanes which only depends on n. 



Lemma 2.5. Let A be an arbitrary lattice in 

C := i .T e M" 
and let 



". Moreover, let 

n-i '■ 

Z — 1 J 



C 



X e 



n-l 
1=1 



Xi < a;„ < 1 



//int(C) n A = 0, then the lattice points C fl A lie on at most 4"n " hyperplanes. 

Proof. Our idea is to cover C with 4"n^" boxes. Then we show that the lattice points in each box lie on 
a single hyperplane. We note that, if a convex set L C R" satisfies L + A = M", then any translate of L 
contains at least one point of A. Furthermore, observe that, for any points vq, . . . , u„ G [0, A-]", it holds that 



X e 



i=l 



Xi{vi — Vq) and ^ K ^ — for alH = 1, . . . , n > C 



1 1 
2n'2n 



and for a sufficiently small a > it holds 



rill 


n-l 


_"2^' 2^ 


X 



71 — 1 — a n 



C int(C). 



(1) 



(2) 



It is straightforward to check that the right hand side in ([T|) and the left hand side in ([2]) are translates. More 
precisely, the set 



1 

2n' 



2nJ 



2(K-a)-l 



1 

2n' 



2nJ 



2„ -" - L 2«' 2nJ '^ L „ > ^]> whcrc e„ denotes the n-th unit 
vector. Observe that C C [-2n, 2n)". Next we partition [-2n,2n)" into boxes. Let D := [-2n, 2n)"n ;^Z". 
Then the cardinality of D is 4"n^". Moreover, C C [— 2n, 2n)" C D + [0, :^]"'. Now assume that there exists 
a box d + [0, ^]", with d € D, that contains n + 1 affinely independent lattice points wq, . . . , w„, i.e. assume 
that Wo, . . . ,wil e A n (d + [0, ;fj]"). Then {a: e M" | x = X;r=i -^^C^i - '^o) and - i < A^ < i for ah i = 



1, . . . , n} + A = R". This, together with ([T]) and ©, contradicts int(C) n A 



D 



In order to apply Lemma 12.51 in our proof of Theorem 11.11 we will adapt it to the notation that will be 
used later and we will show that we can compute the hyperplanes efficiently. 

Lemma 2.6. Let A be an arbitrary lattice. Let P C R" be a {n — 1)- dimensional polytope, and let P be defined 
as in Lemma \2.2[ Furthermore, let y £ R" \ aff(P) such that int (conv({y}, P)) n A = 0. In polynomial time 
in the input size of P and y, we can construct hyperplanes containing all the lattice points conv({?/}, P) n A. 
The number of hyperplanes is at most 4"n'^". 

Proof. From Lemma l2.2l it follows that there exists a (n — l)-dimensional ellipsoid E and a point c G P such 
that c + E C P and P C c + 2{n — 1)E. In particular we can compute P in polynomial time (see Remark 
12. 3p and hence we can compute E and c in polynomial time (see [?| Theorem 3.3.3] or [H]). Moreover, 
there exists a bijective affine mapping A : R" ^^ R" such that A (conv ({j/},c + 2(n — l)i<^)) — C and 
A (conv ({y}, c + E)) = C, with C and C as in Lemma [2751 By applying Lemma [275] to C, C, and A' = A{A) 
it follows that we can place the lattice points conv({y}, P) n A onto at most 4"?t,^" hyperplanes. 

It remains to construct the hyperplanes. For that, we use again the notation of the previous Lemma 12.51 
and its proof. For every d & D there exists a hyperplane Hd such that A^^{d + [0, t-]") n A C H^. By [H 
Lemma 6.5.3], we can determine Hd explicitly in polynomial time. D 



3 Proof of Theorem 11.11 



Let us first outline the main steps of tfie proof. 

We start by applying the A-Feasibility Oracle to the polytope [—B, B]". Assume that the oracle returns 
a point y € K^. We then consider an arbitrary facet F of [—B,B]"-, and define the set Tq :— conv({y},F). 
The basic idea is to successively construct subsets Tq D Ti D T2 ... that satisfy vol(T'i+i) < vol(Ti) for 
all i. This subset construction is iterated until we either obtain a set Ti in which we can find an integer 
point z G Kg-, or the volume of one of the constructed sets is so small that we can apply Lemma [2.11 to 
reduce the n-dimensional problem to a (n — l)-dimensional problem. By our hypothesis of induction, the 
(n — l)-dimensional problem can be solved in polynomial time. 

Let us now explain how the construction of the sets Ti is implemented. Each set T^+i arises from the set Ti 
by intersecting Ti with a half-space as follows. We first define a certain scaling of F, say F, such that F C F. 
Next we employ the Linear Integer Optimization Oracle. If conv{{y}, F) fl Z" = 0, then Lemma [2.61 implies 
that we either find a point 2 G Tj n K^ D Z" by solving a constant number of lower-dimensional problems, 
or we know that Ti D Kq n Z" ~ 0. On the other hand, if conv({y}, F) n Z" ^ 0, then we compute a point 
X* € conv({y}, F) fl Z" closest to y with respect to the normal vector of aff(F). Let H* be the hyperplane 
parallel to aff(i^) and passing through x* . Then we use the Separating Hyperplane Oracle to determine a 
(n — 2)-dimensional hyperplane S* in H* separating x* from the level set H* n Ks. . In turn, S* is lifted 
to the (n — l)-dimensional hyperplane S := afF({?/}, S*). Let S^ be the half-space containing H* n K^ and 
having S as its boundary. We then define T^+i := Ti n S^ . Lemma HH] guarantees a sufficient decrease of the 
volume of Ti-|_i with respect to Ti. It remains to check the integer points between H* and the hyperplane 
parallel to H* and passing through y. For this, we employ Lemma [^TBI again. 



y ^^iii'z: 




Figure 1: Construction of the truncated cones in the proof of Theorem 11.1 



Proof of Theorem ] 1.1[ First, we apply the A-Feasibility Oracle to P = [—B, B]". If the oracle returns that 
there is no point in Kq, then Kq n Z" = 0. So let us assume that the oracle returns a point y G K^. 

We use induction on the dimension n. If n = 1, then wc just check whether \_y\ € K^ or [y] £ K^. In the 
following let us assume that n > 2, and that we can solve all lower-dimensional problems. Let i^i, . . . ,P2n 
be the facets of [-S, B]". Then 

2n 

[-i?,i?r=Uconv({y},F,). 
j=i 

The following procedure is applied to every facet of [— B, -B]". Hence let us only consider an arbitrary facet 
F e {Pi,...,F2„}. We define 

Qa-^F and Tq := conv({y}, Qo)- 

Let a denote the Euclidean distance of y to aff(F). Then the volume of Tq is proportional to the (n — 1)- 
dimensional volume of Qo times a. More precisely, vol(ro) = - vol„_i((5o)- In the following, we construct 



a sequence Tq D Ti Z> T2 .... The construction terminates either with a set Tk which contains an integer 
point z S Kg; we can find; or with the conclusion that no integer point in Tq fl Kq exists. Below we 
show that either way we need to perform at most 0{\og{B)) steps. Moreover, we show that the iterative 
construction of a set Ti^i from T, is performed in polynomial time, and that (Tq \ Tj) n Kq n Z" = 0, and 
vol(Ti_|_i) < (1 — 2~"(?T, — l)^"") vol(Tj) for all i. Since in each step the decrease of the volume is only 
dependent on n, it is guaranteed that for some k £ 0(log(i?)) it holds that vol(rfe) < -^. Then, due to 
Lemma [01 it follows that dim {Tk fl Z" ) < 71 — 1 , and we can easily determine whether Tk D Kq D Z" is empty 
or find a point in Tk O K^ n Z", by induction. 

The iterative construction is as follows. Let Qi and Ti be given. First we define the auxiliary polytopes 



Qi ■■= ix e 



■ + j-{Qi- Qi) C 0,; i and Ti := conv({y}, Qi) 

(see Figure [1] and Remark 12. 3p . Next we employ the Linear Integer Optimization Oracle to solve the linear 
integer program 

minh^x s.t. x eT^nZ", (3) 

where h is the normal vector of aff(i^) such that h^y < h^x for x £ F. We distinguish two cases. 

Case 1 The linear integer program ([3]) is infeasible. Then T^ n Z" = 0. By construction, we can apply 
Lemma [2^ to determine whether there exists an z e {Ti\fi)nKsr\Z'' or whether (Ti\fi)nKonZ'' = 0. This 
requires to solve at most k < 4"n^" subproblems of dimension n — 1. Let these subproblems be contained in 
the hyperplanes Hi, . . . , Hk- Then, for all j = 1, . . . , fc, we test the lower-dimensional sets 

Ti n Hj n Kq n z" 

for feasibility. By assumption of induction, all these problems can be solved in polynomial time. 

Case 2 The linear integer program ([3]) has an optimal solution x*. If x* S K^, then we are done. 
Otherwise, let H* := {x £ M" | h^x = h^x*}, i.e. the hyperplane containing x* and being parallel to aff(i^). 
We define 



q: 


^T,r\H* 


and 


T* 


^conv{{y},Q* 


Qt 


^T,nH* 


and 


f* 


^conv{{y},Q* 



Using the Separating Hyperplane Oracle with A = H* and a ~ x* , let S* C H* be a {n — 2)-dimensional 
hyperplane containing a:* and separating x* from if* n Kl . Next let S denote the unique (n — l)-dimensional 
hyperplane containing y and S* , i.e. S :— afl[({y}, S**). Furthermore, let S^ denote the half-space with 
boundary S, and containing H* n K^ . Then, due to the convexity of the level set, we observe 

(((T, \ Tt) \ H+) n Ko) C ((m \ Tn \ H+) n K.) = 0. (4) 

It remains to check for an improving integer point z G K^ within T* \ T*. For that, we apply Lemma 12.61 to 
T* and T* in the same way that we described in Case 1. If none of the corresponding subproblems returns 
a point z e KsD Z", then together with (g]) we know that {T \ S+) n Kg fl Z" = 0. We define 



ii+l 



:= Qi, n S+ and T., := T n S^ 



It holds that z ^ Kq for all z G (To\Tj+i)nZ". In particular, from Lemma [2^ it follows, that vol„_i(Qi+i) < 
(l-2-"(n-l)i-")vol„_i(Q,+i). Hence, vol(T,+i)<(l-2-"(n-l)i-")vol(r,). D 

4 Extension to the mixed integer setting. 

It is straightforward to extend Theorem 11.11 to the mixed integer setting with a constant number of integer 
variables zi, . . . , Zn and any number of continuous variables xi, . . . ,Xd. Simply replace any query with input 
z G Z" to the evaluation oracle /(•) by a call of a A- Feasibility Oracle applied to a fixed z* G Z" and 
returning the value min{/(z*,x) | {z*,x) G P}. 
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